首页> 外文OA文献 >MIRAGE: An Iterative MapReduce based FrequentSubgraph Mining Algorithm
【2h】

MIRAGE: An Iterative MapReduce based FrequentSubgraph Mining Algorithm

机译:mIRaGE:基于迭代mapReduce的Frequentsubgraph挖掘算法

摘要

Frequent subgraph mining (FSM) is an important task for exploratory dataanalysis on graph data. Over the years, many algorithms have been proposed tosolve this task. These algorithms assume that the data structure of the miningtask is small enough to fit in the main memory of a computer. However, as thereal-world graph data grows, both in size and quantity, such an assumption doesnot hold any longer. To overcome this, some graph database-centric methods havebeen proposed in recent years for solving FSM; however, a distributed solutionusing MapReduce paradigm has not been explored extensively. Since, MapReduce isbecoming the de- facto paradigm for computation on massive data, an efficientFSM algorithm on this paradigm is of huge demand. In this work, we propose afrequent subgraph mining algorithm called MIRAGE which uses an iterativeMapReduce based framework. MIRAGE is complete as it returns all the frequentsubgraphs for a given user-defined support, and it is efficient as it appliesall the optimizations that the latest FSM algorithms adopt. Our experimentswith real life and large synthetic datasets validate the effectiveness ofMIRAGE for mining frequent subgraphs from large graph datasets. The source codeof MIRAGE is available from www.cs.iupui.edu/alhasan/software/
机译:频繁的子图挖掘(FSM)是对图数据进行探索性数据分析的重要任务。多年来,已经提出了许多算法来解决该任务。这些算法假定挖掘任务的数据结构足够小,可以放入计算机的主内存中。但是,随着现实世界中图形数据的增长(无论大小还是数量),这种假设不再成立。为了克服这个问题,近年来提出了一些以图形数据库为中心的方法来解决FSM。但是,尚未广泛探索使用MapReduce范式的分布式解决方案。由于MapReduce成为事实上的海量数据计算范式,因此对这种范式的高效FSM算法有巨大的需求。在这项工作中,我们提出了一种称为MIRAGE的频繁子图挖掘算法,该算法使用基于iterativeMapReduce的框架。 MIRAGE是完整的,因为它返回给定用户定义支持的所有频繁子图,并且由于它应用了最新FSM算法采用的所有优化,因此效率很高。我们对现实生活和大型综合数据集的实验验证了MIRAGE从大型图形数据集中挖掘频繁子图的有效性。可从www.cs.iupui.edu/alhasan/software/获得MIRAGE的源代码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号